Helping Our Own: Text Massaging for Computational Linguistics as a New Shared Task
نویسندگان
چکیده
In this paper, we propose a new shared task called HOO: Helping Our Own. The aim is to use tools and techniques developed in computational linguistics to help people writing about computational linguistics. We describe a text-to-text generation scenario that poses challenging research questions, and delivers practical outcomes that are useful in the first case to our own community and potentially much more widely. Two specific factors make us optimistic that this task will generate useful outcomes: one is the availability of the ACL Anthology, a large corpus of the target text type; the other is that CL researchers who are non-native speakers of English will be motivated to use prototype systems, providing informed and precise feedback in large quantity. We lay out our plans in detail and invite comment and critique with the aim of improving the nature of the planned exercise.
منابع مشابه
SciSumm 2017: Employing Word Vectors for Identifying, Classifying and Summarizing Scientific Documents
This paper describes our approach on ”Recognizing Reference Spans,Classifying Their Discourse Facets and Summarizing from Reference Text” as an attempt in the shared task on relationship mining and scientific summarization of computational linguistics research papers at SIGIR 2017.
متن کاملACL 2016 The 54th Annual Meeting of the Association for Computational Linguistics Proceedings of the SIGNLL Conference on Computational Natural Language Learning: Shared Task
The CoNLL-2016 Shared Task is the second edition of the CoNLL-2015 Shared Task, now on Multilingual Shallow discourse parsing. Similar to the 2015 task, the goal of the shared task is to identify individual discourse relations that are present in natural language text. Given a natural language text, participating teams are asked to locate the discourse connectives (explicit or implicit) and the...
متن کاملUniversity of Illinois System in HOO Text Correction Shared Task
In this paper, we describe the University of Illinois system that participated in Helping Our Own (HOO), a shared task in text correction. We target several common errors, such as articles, prepositions, word choice, and punctuation errors, and we describe the approaches taken to address each error type. Our system is based on a combination of classifiers, combined with adaptation techniques fo...
متن کاملComparative Study of Neural Models for the COSET Shared Task at IberEval 2017
This paper describes our participation in the Classification Of Spanish Election Tweets (COSET) task at IberEval 2017. During the searching process for the best classification system, we developed a comparative study over possible combinations of corpus preprocessing, text representations and classification models. After an initial models exploration, we focus our attention over specific neural...
متن کاملProducing a Persian Text Tokenizer Corpus Focusing on Its Computational Linguistics Considerations
The main task of the tokenization is to divide the sentences of the text into its constituent units and remove punctuation marks (dots, commas, etc.). Each unit is a continuous lexical or grammatical writing chain that is an independent semantic unit. Tokenization occurs at the word level and the extracted units can be used as input to other components such as stemmer. The requirement to create...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010